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In data sparse regions, remotely-sensed observations can be used to improve analyses, 
which in turn should lead to better forecasts. One such source comes from the 
Atmospheric Infrared Sounder (AIRS), which together with the Advanced Microwave 
Sounding Unit (AMSU), provides temperature and moisture profiles with an accuracy 
comparable to that of radionsondes. The purpose of this paper is to describe a procedure 
to optimally assimilate AIRS thermodynamic profiles— obtained from the version 5,0 
Earth Observing System (EOS) science team retrieval algorithm — into a regional 
configuration of the Weather Research and Forecasting (WRF) model using WRF-Var. 
The paper focuses on development of background error covariances for the regional 
domain and background field type, a methodology for ingesting AIRS profiles as separate 
over-land and over-water retrievals with different error characteristics, and utilization of 
level-by-level quality indicators to select only the highest quality data. The assessment of 
the impact of the AIRS profiles on WRF-Var analyses will focus on intelligent use of the 
quality indicators, optimized tuning of the WRF-Var, and comparison of analysis 
soundings to radiosondes. The analyses will be used to conduct a month-long series of 
regional forecasts over the continental U.S. The long-term impact of AIRS profiles on 
forecast will be assessed against verifying radiosonde and stage IV precipitation data. 


16 m Conference on Satellite Meteorology and Oceanogrphy, 11-15 January 2009, 
Phoenix, Arizona. 
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1. MOTIVATION 

• To improve weather forecast using AIRS temperature and moisture profiles 


*SPoRT, NASA/MSFC, Huntsville, AL 
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5. 48-H FORECAST IMPACT 



# 


2. AIRS PROFILES 

2.1. AIRS Profile Description 


Version 5 L2 temperature and moisture 
profiles over land and water 
~50 km spatial resolution; 14 levels 
between 1000 and 50 hPa 
Level-dependent quality indicators (Qls) 
determine maximum pressure level 
above which quality data should be 
assimilated (colored points in Fig. 1) 
AIRS soundings assimilated as separate 
land and water data types 



Fig. 1. Three- 
dimensional distribution 

2. 2. Ma trix 

• Separa^o^se^falhpn errors are used for the land and water soundings 

• Land Wd^^binet al. (2006) 

• E nils^jty es make land soundings less accurate 

• VatfdBtton for SGP which has same environment as SPoRT domain 

• Water: From AIRS instrument specs 

• Better soundings than land 

• Verified against TWP which has same environment as SPoRT domain 



Fig. 2. Background (black line) and observation (blue: AIRS water, green: AIRS land) 
errors for WRF-Var analysis. It is the ratio of the background errors that controls the 
magnitude of the analysis increment during the assimilation process. 

2. ANALYSIS/FORECAST MODEL 
2. 1 Model Configuration 

• WRF-Var V2.2.1 on CONUS domain (Fig. 1) 

• 12-km resolution, 450 x 360 horizontal grid, 37 vertical levels, top at 50 hPa 

• ARW initialized at 0000 UTC each day using 40-km NAM 

• 6-8h ARW forecast used as first-guess for WRF-Var; AIRS profiles assimilated 
at observation time 

2.2. Background Error (B) Matrix 

• A SPoRT B matrix was generated using gen_be with 37 days of WRF 
forecasts initialized at 00 and 12 UTC (NMC Method) 

• The horizontal transform is perform using recursive filter, and vertical 
transform is applied via EOF 


4. ANALYSIS IMPACT: 17 January 2007 




700 hPa WRF-Var Temperature and Moisture Analyses 


Temperature AIRS-Background 


Mixing Ratio AIRS-Background 
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Temperature Analysis-Background Mixing Ratio Analysis-Background 

Fig. 3. Analysis impact of AIRS at 700 hPa. The top row shows innovation fields (AIRS minus background) for 
a) temperature and b) mixing ratio. The corresponding analysis increments (WRF-Var minus background) are 
shown in c) and d). The “x” in a) denotes the location of the Greensboro, NC sounding described in Figure 4. 
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Temperature Results: s s 

• CNTL too cool in lower troposphere; too warm in upper troposphere 

• AIRS warms lower levels and cools upper levels reducing bias (Fig. 5a) 

• Daily bias statistics show temperature improvement in AIRS runs (Fig. 6a) 

Geopotential Height Results: 

• CNTL heights too low in mid- and upper troposphere; good near surface 

• AIRS raises heights at all levels degrading bias near surface but improving 
aloft (Fig. 5c) 

• Daily bias statistics show that AIRS forecasts are biased towards higher 
geopotential heights (Fig. 6b) 

6h Cumulative Precipitation Bias Score Results: 

• Bias score > 1 means over forecasting; bias score < 1 means under 
forcasting 

• Overall AIRS bias score is slightly larger than CNTL (AIRS=1 .51 ; CNTL=1 .48) 

• Daily statistics show mixed results 


> AIRS profiles cooler than background over FL and Great Lakes; warmer over SE 
US (Fig. 3a); they are generally moister except in upper mid-west and eastern 
Gulf of Mexico (Fig. 3b) 

> Absolute magnitude (±4°C for temperature, ±2 g/kg for mixing ratio) and spatial 
distribution of the differences are representative of other levels 

• Analysis increments shows similar temperature (Fig. 3c) and moisture (Fig. 3d) 
pattern compared to innovations 


6h Cumulative 



Fig. 4. Profiles of temperature (so 
and dew point (dashed) near 
Greensboro, NC (GSO) for 0800 l 
17 January 2007. The backgrounc 
(black) and WRF-Var (red) profiles 
are for the nearest grid point. The 
AIRS profile (blue) is for the highes 
quality retrieval closest to the grid 
point. The radiosonde (green) is a 
linear interpolation of the 0000 and 
1200 UTC soundings to 0800 UTC. 
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* Temperature and moisture 
soundings of AIRS-enhanced 
analysis lie between those of 
background and AIRS profiles 
below 300 hPa 

* AIRS correctly shows mid-level 
warming 

* Addition of AIRS produces 
analysis more comparable to 
radiosondes than background 
at mid- and lower troposphere 
(Fig. 4) 



Fig. 6. Time series of each day in the 37-day case study period for 48-hr forecasts of temperature bias (left), 
geopotential height bias (middle), and 6-h cumulative precipitation bias score (right). 

6. CONCLUSIONS/FUTURE WORK 

• Prudent assimilation of AIRS thermodynamic profiles and quality 
indicators can improve initial conditions for regional weather models 

• For this particular analysis, the AIRS-enhanced analysis more closely 
resembles radiosondes 

• Closer investigation of selective case study days 
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1. INTRODUCTION 

One of the challenges in numerical weather 
prediction is to provide forecast models with an 
initial state that realistically describes the 
atmosphere. Observations from satellites are 
one valuable option to improve the model initial 
state, especially in data sparse regions. NASA’s 
most state-of-the-art atmospheric profiler is the 
Atmospheric InfraRed Sounder (AIRS). AIRS 
radiances have been assimilated into global 
models yielding improvements in 500 hPa 
anomaly correlations out to 5-day forecasts (e.g. 
Le Marshall et al. 2006). However, for centers 
focusing on regional forecasting problems — 
such as NASA’s SPoRT Center (Goodman et al. 
2004) — AIRS profile impact on thermodynamic 
structures is a logical first step to using AIRS 
data. A methodology for assimilating AIRS 
profiles into the Weather Research and 
Forecasting (WRF) model using the three- 
dimensional variational (3DVAR) assimilation 
component of WRF (WRF-Var) is presented 
herein along with some preliminary forecast 
impact statistics from a 37-day case study 
period (17 January to 22 February 2007). 

2. AIRS PROFILES 

AIRS — coupled with the Advanced 
Microwave Sounding Unit (AMSU) — forms an 
integrated temperature and humidity sounding 
system. Due to its hyperspectral nature, AIRS 
can provide near-radiosonde-quality 
atmospheric temperature and moisture profiles 
with the ability to resolve some small-scale 
vertical features in both clear and partly cloudy 
scenes. AIRS has an instrument accuracy of IK 
in 1km layers for temperature and 15% in 2km 
layers for relative humidity. The spatial 
resolution of the profiles at nadir is 
approximately 50 km (Aumann et al. 2003). The 
superior vertical resolution and sounding 
accuracy make the instrument very appealing as 
a complement to radiosonde measurements in 
data sparse regions. For this study, standard 
version 5.0 soundings that contain 14 vertical 
levels between 1000 and 50 hPa are used. 


Each profile contains level-specific quality 
indicators (Qls) that define a specific level below 
which data are of questionable quality. This 
level is generally consistent with clouds and land 
effects (Susskind et al. 2006). In this study, the 
Qls are used to select the optimal data from 
each profile for assimilation. A three- 
dimensional distribution of AIRS profiles as 
determined by the Qls for 17 January 2007 is 
shown in Figure 1 . 



Fig. 1. Quality indicators for AIRS profiles assimilated at 
0800 UTC on 17 January 2007. The black points represent 
the highest quality data, and each colored point denotes the 
pressure level above which there are quality data. The red 
rectangle denotes the bounds of the WRF model domain. 

Poorly-defined infrared emissivity due to 
inhomogeneous land type can lead to degraded 
AIRS observations in over land soundings 
(Borbas 2007). Thus, over land observations 
generally have larger errors than those over 
water. In this study, both land and water 
soundings are assimilated and treated as two 
separate observation types in the analysis 
process. To accomplish this task, the WRF-Var 
source code was altered to accommodate AIRS- 
land and AIRS-water data sets with separate 
observation errors for the different observation 
types. Here, only the diagonal terms in the 
observation error matrix are filled. There are 
two strategies for assigning errors to 

observations from a satellite: 1) use the 

instrument accuracy or 2) use the validation of 
the instrument with dedicated in-situ 


observations. Tobin et al. (2006) validated the 
AIRS thermodynamic profiles against dedicated 
sondes in the Southern Great Plains (SGP; for 
land soundings) and the Tropical Western 
Pacific (TWP; for water soundings). Since the 
SPoRT WRF domain is mid-latitudinal, validation 
using SGP sondes is applicable for the land 
retrievals; thus, land soundings are defined 
using the Tobin et al. (2006) estimates. On the 
other hand, the TWP does not represent the 
over-water environment within the mid-latitudinal 
domain, so the AIRS instrument specifications 
are instead used for these soundings. 
Additionally, this arrangement allows the over- 
water soundings to have slightly lower error than 
the over-land soundings, which is consistent 
with AIRS sounding characteristics. Figure 2 
shows the observational errors for land (green 
line) and water (blue line) soundings (black line 
denotes background error; see section 3.3). 



Fig. 2. Background (black) errors and observation (blue: 
AIRS-water, green: AIRS-land) errors for WRF-Var analysis 
for temperature (left) and relative humidity (right). 

3. EXPERIMENTAL DESIGN 

3. 1. WRF Model Set-Up 

The forecast model used herein is the 
Weather Research and Forecasting (WRF; 
Skamarock, 2005) model, a next-generation 
mesoscale numerical weather prediction system 
designed to serve both operational forecasting 
and atmospheric research needs. It is a limited- 
area, non-hydrostatic primitive equation model 
with multiple physical parameterization options. 
The model domain used herein consists of a 450 
x 360 grid with 12-km spacing that covers the 
contiguous United States, Western Atlantic 
Ocean, and Gulf of Mexico (see Fig. 1). It has 
37 staggered terrain-following vertical levels with 
the top-level pressure at 50 hPa and finest 
resolution near the lower boundary. 

The WRF physical options used in this study 
consist of the Ferrier (new Eta) microphysics, 
the Kain-Fritsch cumulus convection scheme 


(Kain and Fritsch 1990), and the Mellor- 
Yamada-Janjic planetary boundary scheme 
(Mellor and Yamada 1982, Janjic 1990). The 
rapid radiative transfer model (RRTM, Mlawer et 
al 1997) and Dudhia scheme (Dudhia 1989) are 
used for longwave and shortwave radiation, 
respectively. The four-layer Noah land surface 
model (Chen and Dudhia 2001) provides the 
land surface physics. 

For this case study, each day’s WRF 
forecast is a cold start initialized using the 0000 
UTC North American Mesoscale (NAM) 
analyses. The boundary conditions are updated 
every 3 hours using the NAM forecasts. 
Although a NAM analysis is available at 0600 
UTC, using the 0000 UTC NAM as initial 
conditions allows the model to adjust 
dynamically prior to data assimilation. The WRF 
model is run from the initialization time to the 
time of the observations. This short-term WRF 
forecast is then used as the first guess field for 
the WRF-Var analysis. Because AIRS is a 
polar-orbiting satellite, the observation time for 
the AM overpass varies between 0600 and 0900 
UTC. The assimilation time is determined by the 
mean time of 2 AIRS data swaths over central 
and eastern North America (the westernmost 
overpass is omitted as these data will likely have 
minimal impact on short-term forecasts over the 
eastern U.S. at 48 hours). The WRF-Var 
analysis, then, re-initializes the WRF for the 
AIRS-assimilated runs (AIRS) and produces a 
48-h forecast. A control run (CNTL) is 
performed in the same manner except no data 
are assimilated. 

3.2. WRF-Var Implementation 

The WRF-Var is the data assimilation 
component of WRF (Barker et al., 2004), which 
works by minimizing a cost function to estimate 
the true state of the atmosphere using a 
previous forecast (background), observations, 
and their respective errors. These errors define 
the weighting at each grid point of the 
background and observations such that larger 
background error for a given variable will result 
in an analysis more closely resembling the 
observation (and vice versa). The observations 
are spread horizontally using a background error 
correlation length scale (Fig. 3), which is a 
function of grid point separation. The 
observations are spread vertically using an 
empirical orthogonal function (EOF) composition 
of the vertical component of the background 
error (Fig. 4). 



Fig. 3. Length scale for each control variable — a) 
streamfunction, b) velocity potential, c) temperature, and d) 
relative humidity — for WRF-Var. The length scale controls 
the horizontal spread of the observations during the 
assimilation process. 


WRF-Var package. The differences between 24 
and 12-h WRF forecasts for the 37-day case 
study period (17 January to 22 February 2007) 
were used to generate the B matrix. The 
background length scale and eigenvectors, 
which describe the horizontal and vertical 
spread of the observations, are shown in 
Figures 3 and 4 respectively. For both 
horizontal and vertical spread, lower vertical 
modes correspond to larger scales of spread. 
The average background error over the entire 
domain is shown back in Figure 2. 

The SPoRT B provides a temperature and 
moisture length scale that is on the order of the 
analysis grid spacing. The first vertical mode, 
which analyzes large-scale features, is close to 
the grid spacing (Fig. 3c and d). 

4. ANALYSIS IMPACT 
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Fig. 4. Eigenvectors for each control variable — a) 
streamfunction, b) velocity potential, c) temperature, and d) 
relative humidity — for WRF-Var. Each color represents a 
mode of the eigenvectors with smaller modes controlling 
larger scales of vertical spread. The eigenvectors control 
the vertical spread of the observations during the 
assimilation process. 

3.3 The SPoRT B Matrix 

Correct use of the background error 
covariance matrix (B matrix) is important in 
determining the appropriate weighting between 
the background and observations, as well as 
how information contained in observations is 
spread horizontally and vertically. Optimal 
analysis configuration desires background errors 
that are consistent with the domain/grid spacing, 
the model used for the background, and the 
season. A SPoRT B was calculated using the 
National Meteorological Center (NMC) method 
of averaged forecast differences (Parrish and 
Derber 1992) using the “gen_be” program in the 


The analysis impact of AIRS on 700 hPa 
temperature and mixing ratio for 17 January 
2007 is shown in Figure 5. Figures 5a and 5b 
show the innovations of a Barnes analysis of the 
AIRS temperature and mixing ratio observations 
and the background field. AIRS temperatures 
are warmer across the southeast U.S. but are 
cooler across south Florida and the Great 
Lakes. The AIRS observations are at times 4°C 
warmer or cooler than the background. AIRS 
moisture is generally moister than the 
background, except in the region from the 
Florida panhandle to coastal South Carolina. 

Figures 5c and 5d show the temperature 
and mixing ratio analysis increments for the 
AIRS analyses. The analysis increments depict 
±2.5°C changes in a similar pattern to the 
innovations in Figure 5a — further emphasizing 
the impact of the AIRS observations on the 
analysis. The moisture fields at 700 hPa show a 
similar impact. The 700 hPa results are 
representative of results at most other levels. 

To further illustrate the positive impact of the 
AIRS profiles, Figure 6 shows a comparison 
between a series of 0800 UTC soundings 
collocated near the Greensboro, NC (GSO) 
radiosonde site. The 0800 UTC radiosonde 
(green) is a linear interpolation of 0000 and 1200 
UTC soundings. The background field (black) is 
too cool and dry below 600 hPa compared to the 
radiosonde. The AIRS observation (blue) at 
0800 UTC is warmer and much moister than the 
background. As anticipated, when AIRS 
observations are assimilated, the analysis (red) 
becomes warmer and moister. For the 
temperature analysis, the inclusion of AIRS 



Fig. 5. Analysis impact of AIRS at 700 hPa for 0800 UTC 17 January 2007. The top row shows Innovation fields (AIRS-background) 
for a) temperature (°C) and b) mixing ratio (g/kg). The corresponding analysis increments (WRF-Var minus background) are shown 
in c) and d). The “x” in a) denotes the location of the Greensboro, NC (GSO) sounding described in Figure 6. 


produces a superior analysis to the background 
(compared to the radiosonde) below 600 hPa. 
The moisture analysis pulls the analysis in the 
correct direction at 700 hPa but moistens the 
analysis too much compared to the radiosonde. 

5. FORECAST IMPACTS 

The impact of the AIRS profiles on sensible 
parameters in the WRF forecasts is conducted 
by comparing parallel runs of the WRF: one with 
AIRS (AIRS) and one without (CNTL). 
Comparisons were made to 50 radiosondes east 
of 105°W, and statistics were compiled every 12 
hours for the daily 48-h forecasts from 17 
January through 22 February 2007. For 
precipitation, verification is done by comparing 
the model outputs with the 4-km NCEP Stage IV 
precipitation data. The 6-h cumulative 
precipitation data are mapped to the WRF model 
domain for direct grid comparison. For 
consistency with the radiosonde verification, 
only grid points that lie to the east of 105°W 
longitude are used. 



Fig. 6. Temperature (solid) and dew point (dashed) profiles 
near GSO for 0800 UTC 17 January 2007. The background 
(black) and WRF-Var (red) profiles are for the nearest grid 
point. The AIRS profile (blue) is for the highest-quality 
retrieval closest to the grid point. The radiosonde (green) is 
a linear interpolation to 0800 UTC. 





0 


Temperature 91a& (C) 


GeopotentJal Height 91as (m) 



<? 


o 

D 

m 


! 


<L 



Geopotontiol HoTght RMSE (m) 



CNTL AIRS 

Fig. 7. Verification statistics of 48-h forecasts compared to 50 radisondes east of 105°W. Biases (forecast minus observations) for 
a) temperature (°C) and b) geopotential height (m) as well as root mean square error (RMSE) for c) temperature (°C) and d) 
geopotential height (m) for 37 days between 17 January and 22 February 2007 are shown. Black lines represent the CNTL cases; 
red lines represent the AIRS cases. 


Figure 7 shows the cumulative statistics for 
the bias and root mean square error (RMSE) of 
temperature and geopotential height for all 37 
48-h forecasts. Differences shown in Figure 7 
are forecast minus observation such that biases 
less than zero indicate forecasts that have 
cooler temperatures and lower heights than the 
observations; biases greater than zero indicate 
forecasts that have warmer temperatures and 
higher heights. 

For temperature, the CNTL run is cooler in 
the lower troposphere and warmer in the upper 
troposphere compared to the radiosondes. Most 
levels show a reduction in bias with the AIRS 
runs warming the cool-biased lower troposphere 
and cooling the warm-biased upper troposphere. 
The only noticeable RMSE difference between 
the AIRS and the CNTL is a slight decrease in 
RMSE between 700 and 500 hPa. To further 
investigate the results shown in Figure 7, a time 
series of the forecast bias at 700 hPa is shown 
in Figure 8a. In Figure 7, the AIRS forecasts are 
improved by approximately 0.2°C. Combining 
the improved RMSE in Figure 7c with the lack of 


a consistently warmer bias (i.e. some AIRS 
forecasts are cooler than the CNTL and some 
AIRS forecasts are warmer than the CNTL) 
leads to a conclusion of real forecasted 
temperature improvement with the inclusion of 
AIRS thermodynamic profiles. 

The 48-h CNTL forecasts show geopotential 
heights that are too low compared to 
observations in the mid- to upper-troposphere. 
However, in the lower troposphere, the CNTL 
forecast is quite good. The AIRS profiles tend to 
raise the geopotential heights of the AIRS 
forecast throughout most of the troposphere, 
which works to reduce the bias in the mid- to 
upper troposphere (Fig. 7b). The height RMSE 
also shows slight improvement at the same 
levels. However, Figure 8 shows — via time 
series of 700 hPa heights — that the AIRS runs 
produce consistently higher heights. All but two 
forecasts show raised heights with the inclusion 
of AIRS profiles. This result combined with the 
result from Figure 7b leads to the conclusion 
that AIRS may be biasing the heights. This 
result may be due to changes in temperature not 



period for 48-hr forecasts of a) 700 hPa temperature bias, b) 
700 hPa geopotential height bias, and c) 6-hr cumulative 
precipitation bias score. 

being appropriately balanced within the model 
and bias propagation throughout the forecast. 
This will be a point of further investigation. 

Here, precipitation is verified using bias 
scores, which is equal to F/O, where F is the 


number of forecasted grid points that exceed 
any given precipitation threshold and O is the 
number of observed grid points that exceed that 
same threshold. Thus, the bias is a measure of 
the under- or over-forecasting of precipitation by 
the model. A bias score of 1 denotes perfect 
agreement in precipitation coverage between 
forecast and observations. Bias scores greater 
than 1 denote over forecasting; less than 1 
denote under forecasting. 

Overall, the average bias score for all 
measurable precipitation (greater than 2.54 mm) 
falling in 48-h forecasts for the 37 case study 
days is 1.48 for the CNTL runs and 1.51 for the 
AIRS runs. This means that AIRS slightly 
increases the precipitation in an already over- 
forecasted model. Figure 8c shows a time 
series of the 6-hr cumulative precipitation bias 
scores. For most days, the CNTL run over 
forecasts precipitation. Mostly, there are only 
small changes to the bias scores with the 
inclusion of AIRS observations. Further 
investigation into the dynamics of the 
precipitation development is necessary. 

6. CONCLUSIONS 

A methodology for assimilating v5.0 AIRS 
thermodynamic profiles into WRF-Var has been 
presented. A short-term WRF forecast was 
used as the background for the analysis, and 
quality indicators were used to select only the 
highest quality AIRS data, which were 
assimilated as separate land and water 
soundings. Results indicate that AIRS profiles 
produce an analysis closer to in situ 
observations than the background field, which 
should lead to improved initial conditions and 
better forecasts when used to initialize a model 
forecast. Forecasts from a 37-day case study 
period from the winter of 2007 were examined 
with mixed results. When compared against 
radiosonde observations, inclusion of AIRS 
thermodynamic profiles improved temperature 
bias, but produced consistently higher 
geopotential heights. Precipitation results are 
mixed for grid point comparisons with NCEP 
Stage IV precipitation data. More analysis of 
individual days from the case study is needed to 
better understand these results. 
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